智能论文笔记

Removing the fat from your posterior samples with margarine

Harry T. J. Bevins , William J. Handley , Pablo Lemos , Peter H. Sims , Eloy de Lera Acedo , Anastasia Fialkov , Justin Alsing

分类：机器学习

2022-05-25

贝叶斯工作流程通常需要引入滋扰参数，但对于核心科学建模，需要访问边缘后部密度。在这项工作中，我们使用掩盖的自回归流量和内核密度估计器封装边缘后部，使我们能够计算边际kullback-leibler脱离器和边缘贝叶斯模型尺寸，此外还可以生成样品和计算边际对数概率。我们将其应用于暗能量调查的局部宇宙学示例和全局21cm信号实验。除了计算边缘贝叶斯统计数据外，这项工作对于在贝叶斯实验设计，复杂的先验建模和似然仿真中进一步应用也很重要。该技术可在PIP可容纳的代码人造黄油中公开获得。

translated by 谷歌翻译

Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting

Su Wang , Chitwan Saharia , Ceslee Montgomery , Jordi Pont-Tuset , Shai Noy , Stefano Pellegrini , Yasumasa Onoe , Sarah Laszlo , David J. Fleet , Radu Soricut

分类：计算机视觉 | 人工智能

2022-12-13

Text-guided image editing can have a transformative impact in supporting creative applications. A key challenge is to generate edits that are faithful to input text prompts, while consistent with input images. We present Imagen Editor, a cascaded diffusion model built, by fine-tuning Imagen on text-guided image inpainting. Imagen Editor's edits are faithful to the text prompts, which is accomplished by using object detectors to propose inpainting masks during training. In addition, Imagen Editor captures fine details in the input image by conditioning the cascaded pipeline on the original high resolution image. To improve qualitative and quantitative evaluation, we introduce EditBench, a systematic benchmark for text-guided image inpainting. EditBench evaluates inpainting edits on natural and generated images exploring objects, attributes, and scenes. Through extensive human evaluation on EditBench, we find that object-masking during training leads to across-the-board improvements in text-image alignment -- such that Imagen Editor is preferred over DALL-E 2 and Stable Diffusion -- and, as a cohort, these models are better at object-rendering than text-rendering, and handle material/color/size attributes better than count/shape attributes.

translated by 谷歌翻译

Exploring Randomly Wired Neural Networks for Climate Model Emulation

William Yik , Sam J. Silva , Andrew Geiss , Duncan Watson-Parris

分类：机器学习

2022-12-06

Exploring the climate impacts of various anthropogenic emissions scenarios is key to making informed decisions for climate change mitigation and adaptation. State-of-the-art Earth system models can provide detailed insight into these impacts, but have a large associated computational cost on a per-scenario basis. This large computational burden has driven recent interest in developing cheap machine learning models for the task of climate model emulation. In this manuscript, we explore the efficacy of randomly wired neural networks for this task. We describe how they can be constructed and compare them to their standard feedforward counterparts using the ClimateBench dataset. Specifically, we replace the serially connected dense layers in multilayer perceptrons, convolutional neural networks, and convolutional long short-term memory networks with randomly wired dense layers and assess the impact on model performance for models with 1 million and 10 million parameters. We find average performance improvements of 4.2% across model complexities and prediction tasks, with substantial performance improvements of up to 16.4% in some cases. Furthermore, we find no significant difference in prediction speed between networks with standard feedforward dense layers and those with randomly wired layers. These findings indicate that randomly wired neural networks may be suitable direct replacements for traditional dense layers in many standard models.

translated by 谷歌翻译

SODA: A Natural Language Processing Package to Extract Social Determinants of Health for Cancer Studies

Zehao Yu , Xi Yang , Chong Dang , Prakash Adekkanattu , Braja Gopal Patra , Yifan Peng , Jyotishman Pathak , Debbie L. Wilson , Ching-Yuan Chang , Wei-Hsuan Lo-Ciganic

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-06

Objective: We aim to develop an open-source natural language processing (NLP) package, SODA (i.e., SOcial DeterminAnts), with pre-trained transformer models to extract social determinants of health (SDoH) for cancer patients, examine the generalizability of SODA to a new disease domain (i.e., opioid use), and evaluate the extraction rate of SDoH using cancer populations. Methods: We identified SDoH categories and attributes and developed an SDoH corpus using clinical notes from a general cancer cohort. We compared four transformer-based NLP models to extract SDoH, examined the generalizability of NLP models to a cohort of patients prescribed with opioids, and explored customization strategies to improve performance. We applied the best NLP model to extract 19 categories of SDoH from the breast (n=7,971), lung (n=11,804), and colorectal cancer (n=6,240) cohorts. Results and Conclusion: We developed a corpus of 629 cancer patients notes with annotations of 13,193 SDoH concepts/attributes from 19 categories of SDoH. The Bidirectional Encoder Representations from Transformers (BERT) model achieved the best strict/lenient F1 scores of 0.9216 and 0.9441 for SDoH concept extraction, 0.9617 and 0.9626 for linking attributes to SDoH concepts. Fine-tuning the NLP models using new annotations from opioid use patients improved the strict/lenient F1 scores from 0.8172/0.8502 to 0.8312/0.8679. The extraction rates among 19 categories of SDoH varied greatly, where 10 SDoH could be extracted from >70% of cancer patients, but 9 SDoH had a low extraction rate (<70% of cancer patients). The SODA package with pre-trained transformer models is publicly available at https://github.com/uf-hobiinformatics-lab/SDoH_SODA.

translated by 谷歌翻译

Temporally Extended Successor Representations

Matthew J. Sargent , Peter J. Bentley , Caswell Barry , William de Cothi

分类：机器学习 | 人工智能

2022-09-25

我们提出了连续表示的时间扩展变化，我们称其为t-SR。 T-SR通过在原始动作重复序列上构造后继表示，捕获了时间扩展动作的预期状态过渡动力学。这种时间抽象的这种形式不能学习相关任务结构的自上而下的层次结构，而是对耦合动作和动作重复的自下而上的组成。这减少了在没有学习层次政策的情况下控制中所需的决策数量。因此，T-SR直接考虑了时间扩展的动作序列的时间范围，而无需预定义或域特异性选项。我们表明，在具有动态奖励结构的环境中，T-SR能够利用后继表示的灵活性和时间扩展的动作提供的抽象。因此，在一系列稀疏的网格世界环境中，T-SR最佳地适应策略远比基于可比的无模型的强化学习方法快得多。我们还表明，T-SR学到的解决这些任务的方式要求学习的策略的始终如一的频率比非临时扩展的策略少。

translated by 谷歌翻译

Deep learning at the edge enables real-time streaming ptychographic imaging

Anakha V Babu , Tao Zhou , Saugat Kandel , Tekin Bicer , Zhengchun Liu , William Judge , Daniel J. Ching , Yi Jiang , Sinisa Veseli , Steven Henke

分类：机器学习

2022-09-20

相干显微镜技术提供了跨科学和技术领域的材料的无与伦比的多尺度视图，从结构材料到量子设备，从综合电路到生物细胞。在构造更明亮的来源和高速探测器的驱动下，连贯的X射线显微镜方法（如Ptychography）有望彻底改变纳米级材料的特征。但是，相关的数据和计算需求显着增加意味着，常规方法不再足以从高速相干成像实验实时恢复样品图像。在这里，我们演示了一个工作流程，该工作流利用边缘的人工智能和高性能计算，以实现直接从检测器直接从检测器流出的X射线ptychography数据实时反演。拟议的AI支持的工作流程消除了传统的Ptychography施加的采样约束，从而使用比传统方法所需的数据较少的数据级允许低剂量成像。

translated by 谷歌翻译

Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality

Tiffany J. Callahan , Adrianne L. Stefanski , Jordan M. Wyrwa , Chenjie Zeng , Anna Ostropolets , Juan M. Banda , William A. Baumgartner Jr. , Richard D. Boyce , Elena Casiraghi , Ben D. Coleman

分类：人工智能

2022-09-10

通用数据模型解决了标准化电子健康记录（EHR）数据的许多挑战，但无法将其集成深度表型所需的资源。开放的生物学和生物医学本体论（OBO）铸造本体论提供了可用于生物学知识的语义计算表示，并能够整合多种生物医学数据。但是，将EHR数据映射到OBO Foundry本体论需要大量的手动策展和域专业知识。我们介绍了一个框架，用于将观察性医学成果合作伙伴关系（OMOP）标准词汇介绍给OBO铸造本体。使用此框架，我们制作了92,367条条件，8,615种药物成分和10,673个测量结果的映射。域专家验证了映射准确性，并且在24家医院进行检查时，映射覆盖了99％的条件和药物成分和68％的测量结果。最后，我们证明OMOP2OBO映射可以帮助系统地识别可能受益于基因检测的未诊断罕见病患者。

translated by 谷歌翻译

Developing moral AI to support antimicrobial decision making

William J Bolton , Cosmin Badea , Pantelis Georgiou , Alison Holmes , Timothy M Rawson

分类：人工智能 | 机器学习

2022-08-12

辅助抗菌处方的人工智能（AI）提出了重大的道德问题。利用与AI驱动的系统一起利用道德框架，同时考虑特定的复杂性，可以支持道德决策以应对抗菌抗性。

translated by 谷歌翻译

Knowledge-Driven Mechanistic Enrichment of the Preeclampsia Ignorome

Tiffany J. Callahan , Adrianne L. Stefanski , Jin-Dong Kim , William A. Baumgartner Jr. , Jordan M. Wyrwa , Lawrence E. Hunter

分类：人工智能

2022-07-28

子痫前期是孕产妇和胎儿发病率和死亡率的主要原因。目前，先兆子痫的唯一明确治疗方法是胎盘的递送，这对于疾病的发病机理至关重要。已经广泛地进行了鉴定出差异表达的基因（DEGS），已经进行了广泛的先兆子痫对人胎盘的转录分析。使用无偏见的测定法确定了DEG，但是，在实验上研究DEG的决策受到许多因素的偏见，导致许多DEGS仍未被评估。一组与疾病在实验上相关的DEG，但与文献中的疾病尚无相关性，被称为无知组。先兆子痫具有广泛的科学文献，大量的DEG数据库，只有一种确定的治疗方法。促进基于知识的分析的工具能够将许多来源的不同数据结合起来，以提出基本的行动机制，可能是支持发现并提高我们对这种疾病的理解的宝贵资源。在这项工作中，我们证明了如何使用生物医学知识图（KG）来识别新型的先兆子痫分子机制。现有的开源生物医学资源和公开可用的高通量转录分析数据用于识别和注释当前未经资助的先兆子痫相关的DEG的功能。使用文本挖掘方法从PubMed摘要中鉴定出与先兆子痫相关的基因。文本媒介和荟萃分析衍生的列表的相对补体被确定为未经投票的前启示性脱位相关的DEG（n = 445），即先前的无知组。使用KG研究相关的DEG，揭示了53种新型临床相关和生物学作用的机械关联。

translated by 谷歌翻译

SP2: A Second Order Stochastic Polyak Method

Shuang Li , William J. Swartworth , Martin Takáč , Deanna Needell , Robert M. Gower

分类：机器学习

2022-07-17

最近，“ SP”（随机Polyak步长）方法已成为一种竞争自适应方法，用于设置SGD的步骤尺寸。SP可以解释为专门针对插值模型的方法，因为它求解了插值方程。SP通过使用模型的局部线性化来求解这些方程。我们进一步迈出一步，并开发一种解决模型局部二阶近似的插值方程的方法。我们最终的方法SP2使用Hessian-Vector产品来加快SP的收敛性。此外，在二阶方法中，SP2的设计绝不依赖于正定的Hessian矩阵或目标函数的凸度。我们显示SP2在矩阵完成，非凸测试问题和逻辑回归方面非常有竞争力。我们还提供了关于Quadratics总和的融合理论。

translated by 谷歌翻译